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Abstract. After discussing the relevance of statistical physics in molecular recognition processes, we present 
a schematic model for ligand-receptor association based on an Ising chain. We discuss the possible behaviors 
of the affinity when the stiffness of the ligand increases. We also consider the case of flexible receptors. 
A variety of interesting behaviors is obtained, including some affinity modulation upon bond hardening 
of softening. The affinity of a ligand for its receptor is shown to depend on the details of their rigidity 
profile, and we question the possibility of encoding information in the rigidities as well as in the shape. An 
exhaustive study of the selectivity of patterns with length n < 8 is carried on. Connection with other spin 
models, in particular spin glasses is mentioned in conclusion. 

PACS. 87.10.Vg Biological information - 82.35.Gh Polymers on surfaces; adhesion - 68.43.De Statistical 
mechanics of adsorbates 



1 Introduction 

■ Molecular biology and biochemistry differ from chemistry 
, and physics by the very high specificity of the interactions 
and the processes that they aim at describing. Words like 
functions or shapes are used instead of molecules, atoms, 
forces or fields. These biomolecules are perceived as ca- 
' pable of processing information (mutual recognition) and 
i performing dedicated actions (switches or catalytic reac- 
• tions) . It takes only two orders of magnitude in size, from 
0.1 nm to 10 nm, to abandon a world of thermal chaos 
and vibrations, and to enter a world of dedicated agents 
and reliable procedures [I]. 

It was suggested by E. Fisher that selective interac- 
' tion and binding between molecules were primarily a con- 
\ sequence of their complementary shapes, which has since 
i been known as the lock and key paradigm. These specific 
' interactions between highly complementary moieties ac- 
counts for the specialized and efficient action of enzymes 
and often explains how drugs work at the molecular level. 
The lock and key paradigm was then extended to include 
final conformational changes that may happen upon bind- 
ing, a mechanism known as induced fit [2 . Since a receptor 
and its ligand have two complementary shapes, any struc- 
tural change results in decreasing the binding properties 
of the pair. 

The tremendous increase of known 3d molecular struc- 
tures (NMR and X-ray scattering) |3n4| and the ever- 
growing computing capacities has made of the search for 
complementary ligand-receptor interaction a very intense 
and competitive research field. Numerical approaches ba- 



sed on the lock and key principle are commonly known as 
docking algorithms. The issue of these strategies depend 
on how successfully are Coulombic forces, hydrogen bond- 
ing, solvation properties, electronic densities. . . accounted 

for mil]. 

As the size of the interacting bodies increases, it is 
natural to question whether statistical physics still plays 
a role in these highly optimized recognition processes. In 
cells, most molecular interactions are subtly balanced in 
order to achieve reversibility and to prevent the occur- 
rence of irreversible aggregation. Entropic contributions 
may help to achieve this balance. 

One of the current goal of computational biochemistry, 
still out of reach, is to predict realistic ligand binding sta- 
bilization energies with an accuracy of about 1 kcal.mol"^ 
(1.6 ksT), compared with experimental measurements [7]. 
The quenching of conformational degrees of freedom upon 
binding, and the subsequent entropic changes must be ac- 
counted for when computing thermodynamical association 
constants with such accuracy. 

The immune system provides unrivaled cases of specific 
mutual recognition. For instance, antibodies are special- 
ized proteins which recognize and bind in an extremely 
specific and accurate manner to foreign bodies invading 
a living organism. A recent bioinformatic study of inter- 
actions between T-cells and the major histocompatibility 
complex (MHC) supports the view that selective interac- 
tions between peptides may owe more to a delicate balance 
among many weak additive interactions, rather than to a 
strong complementary and exclusive mutual binding [S]. 
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Also striking is the phenomenon of allostery. Some pro- 
teins have their function subordinated to the presence of 
an effector. An historically famous example is the tran- 
scription regulation of Lac-operon, for which it was demon- 
strated by Monod and Jacob that the fabrication of the 
lactose digesting enzymes in e.coli was conditioned to the 
presence of a significant amount of lactose in the envi- 
ronment of the bacteria [9|. Modern biology teaches us 
that in the absence of lactose, the protein Lac-repressor 
binds to a stretch of dna, and prevents the expression of 
the genes downstream. When lactose molecules bind to 
-Lac-repressor, the protein shape changes, and it looses 
its ability of binding dna, enabling the expression of the 
genes under its control. The dna and lactose binding sites 
are located on distinct regions of the repressor protein, 
suggesting an action at distance caused by the presence 
of lactose molecules. Recently, the idea of a simple con- 
formational change of allosteric molecules has been chal- 
lenged. By studying a schematic mechanical model of Lac- 
repressor, R. Hawkins and T.C.B MacLeish estimated the 
contribution of internal, vibrational degrees of freedom, 
i. e. a change in protein stiffness induced by the ligand [TO] . 
They concluded that positive or negative binding entropy 
changes AAS were associated to changes (hardening or 
softening) in the effective spring constants used in their 
mechanical model of repressor proteins. It is precisely this 
idea of stiffness and entropic modulation of the binding 
site efficiency that motivates the present work. 

There is, as a matter of fact, a deep and formal connec- 
tion between statistical physics, recognition, binding, and 
information theory. When a ligand binds selectively to a 
patterned substrate, it accomplishes some kind of decoding 
and reads a piece of information conveyed by its target, 
the more conspicuous case being the association of comple- 
mentary base pairs in dna-dna or dna-rna duplexes which 
is the cornerstone of genetic information processing. In the 
following Sections, we intend to show that a simple Ising 
spin chain can be turned into an elementary model for the 
binding of a ligand molecule onto a patterned receptor, 
in the limit where both thermal fluctuations and inter- 
nal entropic degrees of freedom are relevant. Within this 
framework, one can tune the interactions between ligand 
and binding site, as well as the internal stiffness of the lig- 
and, and we also consider flexible binding sites. The binary 
nature of the receptors makes them natural information 
carriers. After deflning the affinity and selectivity of a lig- 
and for its receptor in this situation, we discuss how much 
dependent are the affinities and selectivities on the stiff- 
ness parameters. Then, we proceed by giving examples of 
non monotonic behaviors of the affinities with increasing 
stiffnesses. We exhibit a case of decreased affinity upon lo- 
cal stiffening of the ligand, reminiscent from Hawkins and 
McLeish results. We show that there are affinity biases 
between receptors with similar shapes but different local 
rigidities. We flnally perform an exhaustive comparison of 
all pairs of patterns up to a length of 8 monomers, and dis- 
cuss their intrinsic ability to reliably encode information, 
which is found to decrease with their length. In the follow- 
ing sections, the word receptor will be used with the same 



meaning as "binding site", i.e. an object the size of the 
ligand that binds to it. This linguistic shortcut must not 
occult the fact that in many realistic cases, the receptor 
is a much bigger object than the ligand, and the bind- 
ing site only constitutes a subpart of it. This schematic 
model does not pretend to accurately describe a realistic 
experimental situation. However, despite its simplicity, it 
already displays a fairly rich phenomenology which may 
flnd a counterpart in some real cases. 

Using Ising-like models for modeling selective bind- 
ing is not a new proposal. Indeed, Schmid, Behringer and 
coworkers introduced and performed intensive statistical 
studies of models with very similar Hamiltonians [1111121 
[T5] . Their model describe the contact between hydropho- 
bic and polar patches belonging to too opposite and com- 
plementary binding sites, and is presented as a coarse- 
grained approach to hydrophilic-hydrophobic interactions 
that are central to both protein folding and protein-protein 
interactions. Their spin variables describe the contact be- 
tween the two rigid moieties and account for short range 
local rearrangements of the coarse-grained residues. A cou- 
pling J between "spin variables" is interpreted as a coop- 
erativity property, while we associate ours to a stiffness 
parameter. They model 2d small rectangular patches while 
we discuss more schematic Id interacting chains; our at- 
tempt to encode information in the stiffness pattern in 
addition to the spatial shape is not found in their work. 
The model introduced is Section |3| is similar to their ap- 
proach, but the general approach of Section |3] is original. 



2 Statistical physics of surface-bound 
receptors binding 

Let us consider a number of ligands L in contact with a 
single binding site M linked to a rigid surface. A balance 
is reached resulting in an equilibrium association constant 
M the exchange: 

£froo + 3Vtfroo ^ ^ ' ^Vfbound, (1) 

from which the probability of observing a ligand 

bound to M obeys: 

3^£,m%^(1-P£-m)-P£.m, (2) 

with brackets [.] denoting molar concentrations, and C*' a 
molar standard reference concentration, making the equi- 
librium association constant dimensionless, for instance 
= ImoLL^-^. This is a generalization of the law of 
mass action that one would write for the complexation 
equilibrium of two species £ , M in solution. 

^£,M[''^frcc]-[Mfrco] — [-C • Mbound]-C'°. (3) 

The equilibrium constant depends a lot on the solvation 
(hydration) of £ and M, the ionic content of the solu- 
tion and all the details of close range interactions between 
species. 
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As our current goal is to emphasize the role of the in- 
ternal degrees of freedom of £, we disregard all solvent 
related interactions by making a kind of ideal solution 
assumption, and considering that all ligand-receptor in- 
teractions are short ranged. This assumption on the sol- 
vent is equivalent to saying that all bound and unbound 
conformations of the ligand receptor pair have the same 
solvation free-energy. The mutual affinity of such a pair 
comes entirely from shape and stiffness considerations. 

It becomes possible to express the equilibrium constant 

jy,- as a partition function ratio : 



j^a _ '^AV ^£-M bound 
^ Stt^ Z£ frco-^M free 



£.M 



(4) 
(5) 



Equation Q is a particular instance of the equilibrium 
association constant derived and presented as eq. (13) in 
ref. [13]. "Na is the Avogadro number and V'^ is the vol- 
ume occupied by one mole in the reference concentration 
state C". Z£ free stands for the Boltzmann-Gibbs sum over 
all the internal conformations of £ , with fixed orientation 
and center of mass. ^Mfree is a similar configuration inte- 
gral over the internal conformations of M when L and M 
are apart and not interacting. We assume that the con- 
figuration integral of the bound complex L ■ M reduces to 
a product wZi;.M bound, where Z^.m bound represents the 
sum over all internal configurations of L in contact with 
M (with fixed centers of mass and orientations) and the 
volume V corresponds to all the positions of the center 
of mass of the ligand L relative to the receptor M that 
are considered as forming a bound state. In the context 
of bimolecular associations, v should range up to a few 
angstrom cube (10^'^" m'^). Note that for rigid receptors 
^MfroG equals 1 by construction. 

The association constant is obtained by setting p^.M ~ 
1/2 in equation One can then argue that the partition 
sum per free ligand 



8^2 



free Zm be 



equals the partition sum of the bound complex 



vZz 



M, bound- 



(6) 



(7) 



In writing these expressions, we neglected the specific vol- 
ume change occurring during the binding process. In other 
words, we assume that the Gibbs AG and Helmholtz AF 
thermodynamic quantities coincide. 

Equation ^ links the thermodynamical affinity con- 
stant that can be experimentally determined and the con- 
figuration integrals .^£.m, bound, ^i:froc and ^Mfree that are 
the main focus of this work. Equation Q can be also writ- 
ten as: 

AG" ~ AF° = (AU) ~ TASc, (8) 

with Z\F(°) = -kBT\n{Xl -^) and where {AU) desig- 
nates the average enthalpic change of moieties L and M 



upon binding, caused by their change in conformation 
and mutual interaction, while ASc represents the corre- 
sponding change in configurational entropy. This expres- 
sion stands a particular case of 



Z\G° = (AU) + (AW) - TASc, 



(9) 



demonstrated in refs. [71 I141IT5] . where (AW) accounts for 
the contribution of an explicit solvent to the formation of 
the ligand receptor pair. In aqueous solutions, both hydro- 
gen bonding and hydration forces originate from specific 
interaction with the solvent. Strictly speaking, solvent me- 
diated interactions are associated with (AW) in eq. 
However, it is to some extent possible to take them into 
account by means of an effective hamiltonian and treat 
these interactions as if they were part of the direct inter- 
action term (AU). 

We define the affinity of L for M as e^"|^ (eq. O. 
For a given number of ligands L and binding sites M, the 
affinity controls the fraction of bound pairs of molecules 
(adsorption isotherms) and an increased affinity leads to 
an increase of bound pairs. However, when many patterns 
Ml, M2. . . compete for the same ligand, the affinity is not 
a good assessment of how exclusive is the binding of L for 
a given M. This is why one needs a relative selectiv- 
ity parameter for comparing the respective behaviors of a 
ligand for a target (matching pattern M) and for a decoy 
(mismatching pattern W): 



Sr(-C,W) 



bound Zw free 



C -W bound 



^JA free 



(10) 



or equivalently = ^'c m/^'c w We will be also inter- 
ested in the absolute selectivity, when comparing the 
affinity of the matching pattern with the affinity of a com- 
plete set of decoys: 



Sa(-£) = 



:'i:-Mbound 
■^Mfree 



1 



E 



Zc. 



Wbc 



^Wfre 



(11) 



if the ligand is given a choice between all the possible 
patterns, or 



inf 



Zl 



Mbound 



z- 



Mfr< 



Zl 



Wbound 



(12) 



if only one decoy W is present at a time. The inf operator 
of eq. ([T^ runs over all the possible patterns W that are 
different from the matching pattern M, and picks up the 
best competitor of M, i.e. the one for which the affinity 
difference is the lowest. Eq. ([TTl) is better suited for assess- 
ing the affinity of a ligand for its matching pattern, while 
all the other possible competitors W / M are simulta- 
neously present. This definition of the selectivity depends 
crucially on the set 7 of allowed patterns W. Changing this 
set CP results in changing the selectivity parameters §a.r- 
Using a subset 5" of CP naturally leads to higher selectiv- 
ities. In practice, a large number of poor affinity decoys 
can eventually beat a good ligand-receptor pair fl6j. 
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3 A minimal model for flexible ligand and 
rigid receptor 

We now introduce a model containing the basic compo- 
nents of the above discussion: matching, internal degrees 
of freedom, stiffness, information, and we search for phe- 
nomenon such as stiffness dependent afhnity and matching- 
decoding. 

The ligand L is modeled as an articulated chain of 
n monomers. Each bead i is allowed to occupy only two 
positions labelled by a binary variable Si — ±1 and the lig- 
and can adopt 2" distinct internal conformations. Ligand 
stiffness is enforced by means of next nearest neighbors 
couplings Ji^i+i — ± J, in the spirit of the original spin- 
glass model by Edwards and Anderson PTIITS] . 

The signs of the couplings {Jij} define the ground 
state shape (native shape) of up to a trivial two- fold 
degeneracy, while the moduli J — \ Ji,j\ describe the en- 
ergetic cost associated to bending distortion of the ligand 
(strain). We assume that ligands have a well defined ori- 
entation, say from left to right, and we do not consider the 
possibility of left-right reversal. This can be justified con- 
sidering that biopolymers (proteins, nucleic acids) always 
display such an orientation along the chain. Finally we 
also disregard the possibility of lateral shift between ligand 
and receptors, as occurring for instance in the hybridiza- 
tion of dna oligomers. This physically relevant situation 
increases significantly the combinatorics of the association 
and efficient algorithms have been designed to tackle these 
alignment problems |19| . In this work we purposely focus 
more on the thermodynamical stability of two molecules 
that are prepositioned in the right conformation, or which 
possess a unique, non-degenerated optimal relative confor- 
mation. The outcome of this schematic model is expected 
to be relevant on length scales of about 1 nm between con- 
secutive "monomers" , large enough to invoke some coarse- 
graining of the underlying molecules, but small enough to 
preserve the importance of entropic, conformational de- 
grees of freedom. 

We now introduce a symbolic notation to describe the 
ground state of these flexible molecules. For that purpose, 
one must distinguish between open, free end molecules 
and cyclic, closed end molecules. Open ligands with n 
monomers have 2" internal configurations. There are 2"~^ 
possible ground states, each one of them being doubly de- 
generated due to up-down symmetry. To fully describe 
the shape and ground state of a ligand, we denote by the 
letters u and d the position, respectively up or down, of 
the first monomer on the left. We then associate a p to 
each antiferromagnetic coupling constant J > 0, and a m 
to each ferromagnetic one J < 0. A symbol (o) is added 
at the end to signal an open chain. The ligand £ rep- 
resented on the left of Figure [T] is thus ascribed to the 
symbol £ =:u/ppp(o). The corresponding Ising configura- 
tion reads si = +1, S2 = —1, S3 = +1, S4 = —1, and the 
coupling constants are J12 — J23 = J?,a = J ■ 

Cyclic ligands with n monomers have spin si and s„ 
coupled with a term JniSiSn- Cyclization is not really jus- 
tified by the initial molecular association problem, it is 



here just a convenient way to simplify calculations and 
get rid of boundary effects, giving to all monomers the 
same importance and simplifying the interpretation of re- 
sults. Cyclic ligands of length n have the same number of 
configurations as open ligands, but it requires n coupling 
constants to fully determine their ground state. Cyclic lig- 
ands with an odd number of antiferromagnetic p couplings 
are "frustrated" , meaning that no ligand configuration 
can satisfy simultaneously all the constraints imposed by 
the Ji j. The ground state conformation is then at least 
four-fold degenerated. Cyclic ligands with a even num- 
ber of p have a well defined two-fold degenerated ground 
state. For instance, the unfrustrated cyclic ligand with the 
same shape as represented on the left of Figure [T] is coded 
as L' =u/pppp(c), where the suffix (c) reminds of the 
cyclic character of the molecule. Ligand L" =u/pppm(c) 
is frustrated with no well defined ground shape. In this 
schematic approach, the use of open or cyclic ligands is 
essentially a matter of convenience, as they generate qual- 
itatively similar results. 

In the same way, the patterned receptor (binding site) 
is represented by a sequence of binary values bi, 1 < i < n, 
each one taking a value ±1. When the ligand comes in 
contact with the receptor, it gains a negative stabilizing 
energy ~A whenever the monomer position Si and the 
receptor value bi match. We do not give a penalty to a 
mismatch situation bi ^ Si, but this could just be done 
by shifting negatively of the total configurational energy. 
The coupling constant A represents a short range interac- 
tion, possibly mimicking hydrogen bonding or hydropho- 
bic patches. In both cases, the effective contact param- 
eter A may depend on temperature. The resulting total 
"Hamiltonian" describing the ligand and the receptor in 
close contact is: 

n n 

^r{Si} = ^ (^J^^i+lSiS^+l^ ~ ^'^'bi ■ (13) 

i=l 1=1 

The sum runs until n' ~ n — 1 for open chains, and n' — n, 
with coupling J„,„+i — Jn,i for cyclic chains. As Sgb = 
(1 -I- sb)/2 for Ising variables, the Hamiltonian: 

" A " nA 

^r{St} [J^a+lS^S,+ l^ ~ "2 ^ (^'^0 ~ ~Y ' ^^'^^ 

i=l 1=1 

assumes the form of a random field Ising spin glass with 
both quench bond disorder Jij and quenched random mag- 
netic field —Abi/2. However, contrary to usual disordered 
systems studies, we do not perform here any average over 
the quenched random bonds, as these couplings contain 
the relevant information. The related partition function, 
expressed with the inverse Boltzmann factor /?, is 

^£-M bound = 

v-^ (nPA , pA^, ^\ 

2^ exp — — |-2^(-/3Jj,,+iSjS,+i)+ — 2_^{s^h) 1 . 

{Si=±l} \ 1 = 1 i = l / 

(15) 
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Meanwhile, Zjvtfroo 
of (HSl with A = 0: 



1 and Zz^frcc is a special instance and 



-^£froo = ^ exp ^(-^Ji^j+iSiSi+i) , (16) 
{si=±l} V i=l ) 

= 2" [cosh(^ J)" ± sinh(/3 J)"] , 

result valid for cyclic chains, with a sign + without bond 
frustration, and a sign — otherwise. 

Finally, from the binding free energy AF, defined as: 



(1 + a^)j^ + 4(q^ + + flS) + 2a^j- 
2a4j4 + 4a2 + 2a4 + 4a6 + 2a4j-4 + 1 



■,8- 



Z\i^ = -keTln 



^£-M bound 



^£frcc^Mfroc 



(17) 



one deduces the partial enthalpic AU and entropic —TASc 
contributions by numerically differentiating with respect 
to /3. 



AU 
-TAS, 



djpAF) 
dp ' 
AF - AU, 



(18) 
(19) 



where it is assumed that A does not depend on tempera- 
ture (enthalpic contribution). 

To describe the shape of rigid receptors, it suffices, in 
principle, to enumerate the values bi. However, anticipat- 
ing the case of fiexible receptors that will be considered 
in the coming section, we use for receptors the same con- 
vention as for ligands, i.e. a first letter u or d, followed 
by a list of couplings p and m. Receptors with matching 
ground state are called M, W being associated with those 
with mismatching ground states. 

Let us illustrate the preceding section with £=u/pppp(c) 
M=u/pppp and W=u/pmpm. The coupling constants of L 
are {>/i,2 = J2,3 = Jza — "^4,1 ~ ^^i^ matching mo- 
tif corresponds to &i = 63 = 1, 62 = &4 = — 1 and the 
mismatching motif to fei = 64 = 1, 62 = ^3 = — 1- The cal- 
culation of the partition functions by enumeration of the 
16 configurations of £, or with a transfer matrix method 
gives: 



§r(a, j) = 

- / - 'iu- - -t- 1U — r^u" '/ * ^ 1 -r 

(22) 

We are interested in assessing the role of the stiffness 
parameter j. In Figure [2j we observe that the affinity of 
the ligand L for the matching pattern M increases mono- 
tonically with j . Figure |3] represents the variation with j 
of the thermodynamic potentials AF, AU and —TASc. 

At the opposite, the stiffness j reduces the affinity of £ 
for the mismatching pattern W, as represented in FigureHl 
with the thermodynamic functions shown in Figure [S] In 
addition, one notices that the special case j = 1 repre- 
sents a soft ligand which can adapt to any pattern. Quite 
naturally, the selectivity between W and M is 1 for j — 1 
and tend towards a finite value for j ^ 00 (Figure [6]). The 
maximal selectivity depends on the short range contact 
parameter a and is reached as soon as j > a. 

We conclude that stiffness is always favorable when 
the shape of a ligand and a receptor agree, but becomes 
unfavorable is a mismatch is present. When ligand and 
receptor shapes almost agree but not perfectly, there must 
be an optimal compromise between a very soft ligand j = 
1, which precludes any selectivity at all, and a very hard 
ligand j ^ 1 which excessively penalizes the mismatches. 



Z 



£M bound 



(1 



4{e 



fiA , 2I3A 



(20) 



4 Flexible receptors 

The next step is to consider flexible receptors M: we ex- 
pect then soft ligands to beat stiff ligands, as they will 
better fit the various configurations of M. In our model, 
M and L play a dual role and it becomes possible to treat 
ligand and binding site (receptor) on the same footing, 
by inserting coupling constants K between the "spins" 
hi. This situation arises when two molecules with similar 
weight and structure bind together (Figure [7]) • 



n' n n' 



+e3'3^) + 2e 



(23) 



^£Wbound = 264/3 V''^ -t- 4e^^ -I- 26^^^ 

+4e3^^ + 2e^f'\-^^-' + e^f"^ + 1. 



From now on, we assume that k^T sets the energy scale, 
and introduce the dimensionless coupling constants a — 
exp(/3^/2), j — exp(/3J). Affinity and selectivity are ra- 
tional fractions of a and j . 



The problem can be naturally solved for cyclic lig- 
ands and receptors with 4x4 transfer matrices. When all 
the coupling constants have same absolute magnitude, we 
define = •ni\K\, k = exp(^|ii'|), Ji^t+i = ei|J|, 

j = exp(/3| J|) where rji and are ±1, to find: 



e2;,M(«,j) 



(1 + a^)f + 4(a2 + a* + a^) + 2a^j 
12 + 2j4 + 2j-4 



4 ,--4 



^£,M bound - a"Tr n^(aj!fc) 



(24) 



(21) 



Zl frcc^M free = 4" [cosh(/3 J)" ± sinh(/3 J)"] (25) 
•[cosh(^i^)"±sinh(/3if)"]; 
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(the sign ± depending on the bond frustration along the 
chains) with noncommuting matrices defined as: 

/ aj'k'^ j-^k"^ fk-'J ai-^k-^\ 
rj^icv) _ I j^'k'^ a-^fk'' a-^j-^k-"^ fk-^ 1 
^(aj.fc)- y-k-v a-^j-'^k-'^ a-^fk'' j-'k'J " 

\aj~'k-'> j^fc-'' j-'k'> afk'i J 

(26) 

In computing the trace, one actually performs a sum- 
mation over the 4" internal configurations. Thermody- 
namic quantities AF, ASc and AU, given by equations 
(fT7|) . (|T8)) and (fT9|) . are then obtained by numerically dif- 
ferentiating the transfer matrix results. One notices that 
the transfer matrix is entirely built from dimensionless 
parameters a, j and k. The numerical differentiation of 
f3AF with respect to /3 assumes that A, J and K do not 
depend on temperature, leading to da/d/3 = /3~^aln(a), 
dj/d/S = /3"ijln(j) and dft/d/3 = /3-iftln(fc). As a re- 
sult, the thermodynamic quantities derived from equa- 
tions ([TT)). (flSl) and (flQl) are automatically expressed in 
units ksT. Non purely enthalpic contributions to the con- 
tact energy parameter A could also be included in this 
numerical scheme by using a different prescription for the 
derivative da/d/3. 

The external random field bi which was quenched for 
rigid receptors is now annealed (the random bonds still 
quenched), and we checked that for large values of K 
(namely k = exp{f3K) > 5) the result for a flexible lig- 
and receptor pair tends to the predictions for the rigid 
receptor. One can easily convince oneself that there is no 
difference between a short rigid receptor and a short flex- 
ible receptor with doubly degenerated ground state (i.e. 
no frustration) for which the condition k ^ j 1 holds. 
In this limit, .^M,frco — 2, a factor which also appears in 

.M, bound; 

leaving unchanged. In practice, we re- 

garded fc = 10 as sufficient to reach the rigid situation. 
This corresponds to an energy gap of 3 ksT between the 
receptor ground state and its first distorted state. Indeed, 
all the results regarding rigid receptors presented in this 
study were actually obtained by setting k to large values 
such as fc = 10 and applying eq. 

To calculate the partition function of a flexible, open, 
ligand receptor pair, one replaces the last matrix T'^^^'J^ 
by a matrix T(^a,i,i) representing the freely oscillating ends. 
Formula (|24p becomes 

^£,Mbound = a"Tr (r^a.ui) J[ T^^jS^ ; (27) 

The transfer matrix formalism can be modified if one 
wishes to pick-up a particular bond coupling Ji^i+i and 
assign to it a value different from the usual J. In our 
numerical implementation of the transfer matrix product, 
we use a 5-letters alphabet {p,m, P, M, .} to describe a 
ligand pattern L. 

— Characters p and m are respectively used for e = 1 and 
e = -1. 

— Character . denotes a vanishing coupling constant J = 

U - 1). 



— Characters P and M represent e — 1 and e = — 1, but 
with a different (usually larger) magnitude of J or K. 

Symbols P and M code for some localized hardening 
of ligand and receptors. Symbol . loosely connects two ad- 
jacent and stiffer domains of L. More complex scenarios 
can be considered, but all are subject to the same limita- 
tion, which is that these transfer matrices can deal only 
with nearest neighbor couplings. 



5 Ligand stiffness and affinity 

We now provide a list of examples illustrating various be- 
haviors. All curves represent the affinity variations when 
the ligand stiffness is increased from j = I (soft ligand) 
till j = 8, i.e. a 2 k^T activation barrier associated to 
local conformational change (spin reversal). 

Uphill nonmonotonic affinity: 

Ligand L =u/ppp .ppp(o) vs M =u/ppmmppm (Fig- 
ure [5]) 

The ligand L is made of two loosely connected adjacent 
domains ppp. The resulting affinity is nonmonotonic, 
first increasing, then decreasing (Figure |9l a feature 
previously seen in Figure 21). This behavior emerges 
from a competition between the matching subdomain, 
favored by large values of the stiffness parameter j and 
the mismatching subdomain whose affinity decreases 
with j. In this particular case the mismatching domain 
forces the overall affinity to decrease below its starting 
point, a maximum being reached around j ~ 1.6. Such 
a behavior illustrates the concept of optimal stiffness, 
where the association of a ligand with a (more) rigid 
receptor requires some tuning of its average rigidity. 

Downhill nonmonotonic affinity: 

L =u/ppppppp(o) vs W =u/ppmmppni (Figure [TU]) 
For a = 2 and = 10 (rigid pattern), the affinity curve 
is not monotonic with j, first decreasing, then increas- 
ing (Figure llip . The affinity is minimal for a certain 
value j ~ 1.6. This behavior in enhanced when a is 
increased and reduced when the receptor stiffness k 
is reduced, as seen in Figure [T21 The interest of this 
situation is to be the exact opposite of the preceding 
situation, with a local affinity minimum for j ~ 1.6. 

— Unbinding upon local hardening: 

Ligand L =u/ppPpp(o) vs M =u/ppmmp (Figure [T5)) 
The receptor shows a slight mismatch located under a 
region in L which is locally more rigid (stiffness J > J) 
than the average coupling. Increasing J while keep- 
ing J constant should hamper the capacity of L to 
accommodate the mismatch, and lead to lower affin- 
ity. This is precisely what is seen in Figure [TU The 
effect is seen for open chains (j = 1.5, j = 2.) and 
cyclic chains (j = 2). It is reduced if the average chain 
stiffness is larger (j = 3). If one admits that the stiff 
bond P is under control of an external agent or effector. 
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the issue is that the affinity of the ligand for its tar- 
get decreases even though their shape are not altered. 
Figures [TSl and \W\ show respectively the energetic and 
entropic contributions to the changes in affinity asso- 
ciated with increasing the stiffness parameter j. One 
can read from these data the associated relative en- 
tropy change AAS = ASQ) - AS{j). Ref [TOl sug- 
gests that a positive AAS can be associated to the de- 
creased affinity of Zac-repressor for dna in the presence 
of lactose. This change in AAS arises from increasing 
one spring constant and decreasing another one, in an 
harmonic model of Zac-repressor. In our case, because 
our model is not harmonic, changes in AAU and AAS 
cannot be separated and both contribute to changing 
the affinity. Note that if the hard bond P was located 
on top of a matching subdomain, the opposite behav- 
ior of the affinity with J would occur. For instance the 
affinity of the pair u/ppPpm(o)-u/ppppm increases with 
large values of J (not shown) . 

To sum up, by combining matching and mismatching 
subdomains and bonds of adjustable stiffness, it is possible 
to obtain a variety of nonmonotonic behaviors of a ligand 
affinity for its target. It turns out to be possible to control 
the mutual affinity by acting on the rigidity of selected 
bonds, mimicking possibly the influence of a cofactor (or 
effector) involved in some allosteric mechanism. 

6 Selectivity and information decoding 

Let us consider two persons A and B willing to communi- 
cate, and in possession of a number N of specific ligand- 
receptors pairs. A sends a ligand « to B, and B brings 
this (still unknown to him) ligand in contact with all the 
receptors available in his library. By using an analytical 
tool (fluorescence, quartz crystal microbalance, resonant 
surface plasmon absorption. . . ) B determines the label i 
of the ligand he has received. By referring to a preestab- 
lished codebook, shared with A, B determines the content 
of the message. Information is conveyed to B by means of 
selective binding [20 . 

Information is coded in the shape of molecules, and es- 
pecially biomolecules. When the biomolecule is in solution, 
this information about shape and chemical composition is 
made available to other molecules and a few of them will 
be able to selectively bind to it. Any selective adhesion 
process can be considered as information reading, or to 
be more precise, information decoding. 

There are a number of issues raised by this medium 
of communication. One may be concerned by the reliabil- 
ity of the message transmission, directly connected to the 
absolute selectivity properties of the ligand-receptor pairs 
§a- One may also asks oneself whether there are physi- 
cal bounds to the minimal binding energy or contact area 
necessary to ensure a transmission error rate lower than a 
predeflned threshold value pT[[^ . 

Coding and decoding information is every day's con- 
cern for radio engineers, and many coding schemes have 
been developed in order to safely carry information around. 



The simplified model discussed in the previous section es- 
tablishes a convenient connection with the realm of digital 
information, as ligand and receptors are already described 
in terms of binary chunks of information. 



6.1 Dependence in the number of mismatches 

I now consider the selectivity of L =u/pppp (c) , in contact 
with altered conflgurations Wi =u/pmmp and 
W2 =u/pmpin. Both altered patterns differ from the origi- 
nal one respectively by one and two values of the spin Sj 
(Hamming distances dn — 1 and dn = '2,, cf Figure [T7|) . 
However the Hamming distance relative to the stiffness 
pattern is = 2 for both Wi and W2 and is not directly 
related to the Hamming distance of configurations. 

Results are shown in Figure [TB] It seems that the se- 
lectivity bias is indeed lower for dn = 1 than it is for 
dn = 2. One expects two qualitatively different limits for 
the selectivity dependence in the number of mismatches. 
If the stiffness J is larger than the contact energy A, the 
selectivity is likely to correlate better with the Hamming 
distance d'^ between stiffness patterns: one can flip an en- 
tire interval of spins by changing only two characters in 
the stiffness pattern. In the opposite limit, the distance 
between configurations dn should be dominant. Ref. |23| 
deals in details with related issues. 



6.2 Selectivity of finite length symbols 

We calculated the absolute selectivity (eq. [TT|) among the 
subset of all n-long ligand and receptor symbols, to deter- 
mine the true amount of information that such a family 
of ligands may carry. 

To be more precise, each ground state conformation 
L among the 2"~^ possible ones (due to up-down degen- 
eracy) was compared with its matching counterpart M 
along with all the 2"^^ — 1 other competitors W, to de- 
fine a shape-dependent gap of selectivity §a('C), eq. PT|) . 
In this procedure, all bonds have the same stiffness j and 
the receptors are considered rigid, fc = 10. This shape- 
dependent gap of selectivity was subsequently minimized 
with respect to all different possible L , resulting in a quan- 
tity Gap(a, J, n) characteristic of this family of ligands. In 
particular, Gap(a, j, n) provides a lower bound for the oc- 
currence of false positive if a ligand is put in contact with 
an equal number of all possible receptors. Using eq. (|12p 
instead of eq. ((TT|) leads to a different quantity Gap'(a, j, n) 
related to the noise to signal ratio when transmitting a 
message using a library of compounds as suggested earlier 
in this section. 

Finally, the lower bound Gap(a,j, n) has to be opti- 
mized with respect to the internal parameters a and j, 
in order to assess, in a context independent manner, the 
intrinsic coding capacity of a family of ligands with given 
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length n, defining in this way GAP(n). The resulting for- 
mulas are: 



Gap(a,j, n) = inf Sa('C) 

ligands C of length n \ 

GAP(n) = sup Gap(a, j', n) 

a.j 



(28) 
(29) 



sup inf 

n i \ ligands C of length n 



Sa(£) 



The inf operator defines the selectivity bias of a matching 
receptor competing against a league of all other receptors. 
The sup operator maximizes Gap with respect to a and j . 

In practice, cyclic chains were preferred to open chains, 
as they confer to all monomers the same importance. This 
optimization was performed by exhaustively scanning a 
rectangular grid of values (a, j) S [1, 9] x [1, 9] with step 0.1, 
k being kept equal to 10. Our findings are rendered as a 
2d plot of Gap(a, j, n — S)vs (a, j) (Figure [T9|) . and shows 
that the gap increases with both a and j, but grows only 
marginally beyond a,j ~ 2. Thus, we took as representa- 
tive the maximum obtained for (a = 8.8, j = 8.8) for es- 
timating GAP(n). This arguably constitutes only a rough 
estimate of the sup^, j operator of eq. (|29p . Similar results 
were observed for all the values of n = 3, ... 8 considered 
in this work. Values of ln[GAP(n)] are reported in Ta- 
ble [TJ In [GAP] decreases almost linearly with n (Table [ij 
as the number of decoys grows exponentially. Extrapolat- 
ing to large n, GAP(ri) vanishes for n > n* ~ 20, length 
for which, in the presence of an equal number of all re- 
ceptors, a ligand has more chance to bind a mismatching 
receptor than its own complementary one. 

If one uses eq. (|12p as an alternative definition of GAP, 
one observes a saturation of this quantity to a n indepen- 
dent value, close to e^-^^ ~ 76 (for a = 8.8, j = 8.8, 
k = 10, which are quite large values). This means that, 
irrespective of their length, any matching ligand receptor 
pair L — M. has an affinity larger by a factor 76 than any 
other mismatching pair L — W. Selectivity is not sen- 
sitive to the growing number of decoys (or "complexity" ) 
with n. 

One learns from these results that if one wishes to 
preserve a minimal selectivity while increasing the ligand 
length n, one must necessarily select a subset of shapes as 
valid ligands and disregard the other possibilities. This is 
tantamount to introducing "redundancy" in the "coding 
scheme" . Note that the numerical value of Table[T]depends 
on the details of the a,j maximization and on the choice 
of fc. 



6.3 Sensitivity to stiffness profiles 

The following step is to investigate the importance of stiff- 
ness patterns compared with shape patterns. Is it possible 
to encode information in the rigidity profile ? This situa- 
tion is motivated, for instance, by the bioinformatic study 



of Sacquin-Mora et al. who investigated correlations be- 
tween mechanical properties and binding location in pro- 
teins |24j . It would be indeed very interesting to determine 
whether self assembling pairs possess some kind of tactile 
sense and are sensitive to their respective local surface 
rigidity. 

We consider four identically shaped ligands and recep- 
tors with patterns Z =u/PPppPP(c) (ligand), u/PPppPP, 
u/PPPPpp,u/PpPpPp and u/ppPPpp (receptors), where p 
stands for a soft coupling and P for a stiff coupling (Fig- 
ure HOI)- 

The corresponding affinities are shown in Figure 1211 
The graph shows that the best affinity is observed when 
both patterns coincide (self) and the lowest when the stiff- 
ness pattern is opposite to the ligand (opposite). The mis- 
match u/PPPPpp is almost optimal and the intermediate 
case u/PpPpPp gives an intermediate value. The thermo- 
dynamic study shows that in both situations, the entropic 
contribution is dominated by the enthalpic contribution 
(Figures [221 [H] and [M)) . The stiffness patterns act on 
both enthalpic and entropic terms and the issue of the 
competition is not simple to predict. For instance, ligand 
L =u/PpPpPp(c) has a stronger affinity for W :=u/PPppPP 
than for its self pattern M (not shown). 

If these results show that the affinity is sensitive to the 
stiffnesses, encoding information in stiffness profiles does 
not seem as straightforward as it is for shapes. Convey- 
ing information by means of selective binding, as sketched 
earlier in this section, would imply to design a quadru- 
plet £i, £2, M3, M4 of ligands and receptors with identical 
shapes but distinct rigidity profiles, with affinities obey- 
ing: 



(Da pa 



pa 

pa pa 



pa 



M2 

,Mi 



(30) 



We could not come up with such a quadruplet. These 
quadruplets may not exist, or require longer lengths that 
the ones considered in this work (say n > 8). Stiffness 
profiles modulate the recognition process, but we were not 
able to prove that they could be substituted to shape pro- 
files. We believe it is still an open issue to know for sure 
if stiffness profiles alone can bear some information. 



7 Conclusion and Perspectives 

It is clear that a ligand with too few degrees of freedom 
cannot achieve good selectivity. In the top of Figure [551 
a fine receptor is in contact with a coarse ligand, which 
averages out the details of the receptor. If one changes one 
bit of the receptor, the binding properties of the ligand 
are only marginally altered, and the selectivity ratio stays 
close to 1. 

At the opposite, a ligand with many monomers in con- 
tact with each element of the receptor, will have a greater 
tolerance to the fluctuations of a particular monomer (Fig- 
ure US bottom). Binding is enhanced by the addition of 
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every monomer contribution. This case is a close anal- 
ogous to the so-called "repetition code" in information 
theory. The repetition code consists in repeating many 
times every single bit of information to ensure the safe 
transmission of a code word (the message). It is a greedy 
procedure, as the length of the message is increased by the 
same factor. Here, a greater selectivity is achieved, at the 
expense of a greater complexity of the molecule £ which 
is thrice as long. As a rule, repetition is the easiest way 
one can come up with to amplify the trends observed in 
Figures [S] and [T^l By glueing together copies of the same 
patterns, one automatically enhances the characteristics 
of the ligand- receptor repeated unit. 

To summarize a comparison with the results presented 
in [TTlfT^fT^ . we can say that we focussed mostly on in- 
teresting characteristic trends exhibited by a few selected 
pairs of ligand and receptors, while the authors of [TTl 
[T^fTB] favor global and averaged trends running over the 
whole set of possible patterns. Our definition of the Gap 
and GAP indicators differ from the free energy difference 
AF which serve as a criterion in their work. Our transfer 
matrix approach treats exactly the binding statistics of a 
given pair of ligand and receptor, without need of Monte- 
Carlo sampling, nor mean-field or large J approximations 
that are required in their 2d approach. Finally we believe 
that our model is the first one that makes it possible to 
investigate the role of local stiffness modulation, and the 
possibility of stiffness encoding of information. 

The connection between error-correcting transmission 
codes and spin systems was recognized by N. Sourlas [251 
1261127) . In particular, it was shown that the usual binary 
parity checks 6i © 62 ® . . . © 6p which involves the sum of 
p binary digits modulo 2 was in fact equivalent to cou- 
pling p spins S1S2 • • • Sp, with Si = ±1, leading to a formal 
connection between information theory and p-spin glasses. 
Our model is currently restricted to nearest neighbors cou- 
pling, and cannot account for long range couplings. Infor- 
mation redundancy is thus limited to simple repetition 
codes. It would be interesting to investigate how a second 
layer of spins and couplings could be added in order to 
better enforce robustness with respect to single bit mis- 
matches. One also notices that a 2d generalization of the 
ligand shape would indeed be equivalent to a genuine 2d 
Edwards- Anderson spin glass. Spin glasses are well known 
for their long-lived or metastable states [TSII^ . Each one 
of these states can be put in correspondence with a match- 
ing random- field representing a different receptor. 

Another interesting connection between spin systems 
and pattern recognition, is the Superparamagnetic cluster- 
ing of data [29i„30j . Wiseman, Blatt and Domanyi showed 
that it was possible to train a two dimensional array of 
Potts spins in order to recognize picture features and pat- 
terns (2d inhomogeneous distributions of points). This 
work could provide hints on how to train a 2d elastic net- 
work for shape and stiffness recognition. 

The nonmonotonic behaviors of the affinity, or the de- 
crease of affinity upon local stiffening of the chain are still 
modest, showing only a cut by half in the case illustrated 



in Figure (jl4p . One may be interesting in finding stronger 
effects by hardening more than a single bond. 

The schematic model introduced and studied here is in- 
tended to guide us towards more realistic examples, such 
as simple molecules that could be designed and investi- 
gated with the help of coarse grained or all atom numer- 
ical models. This, we believe, should be the next step to 
endeavor. 
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Fig. 1. Picture of the ground state of a ligand molecule 
L =u/ppp(o) and a matching receptor M =u/ppp. With 4 
monomers, the flexible ligand explores 2'* — 16 configuration 
states, while the rigid receptor does not change its conforma- 
tion. On the right, a mismatching pattern W =u/pmp is shown. 




(J 2000 




j=exp(|3J) 



Fig. 4. Affinity C of ligands L =u/ppp(o) and L' =u/pppp(c) 
with a mismatching rigid receptor (decoy W =u/pmp. Figure [T] 
on the right). The stiffness j decreases the affinity of the cyclic 
pair while the affinity of the open chain decreases shortly after 
a maximum. Here a = 3. 




j=exp(|3J) 

Fig. 2. Affinity 6" of open L =u/ppp(o) and cyclic 
L' =u/pppp(c) with their rigid matching receptor M, (Figure[T] 
on the left) as a function of the ligand stiffness parameter j, 
with a = 3. Here stiffness is shown to enhance the affinity of 
the pair. A ligand whose native state matches the binding site 
of a receptor is said to be pre-organized for matching. 



= AU 
--TAS 




= AU 
--TAS 



j=exp(|3J) 



Fig. 5. Thermodynamics of £ =u/ppp(o) on receptor 
W =u/pmp, a = 3, expressed in units /3~^ = ksT. 



j=exp(pj) 

Fig. 3. Thermodynamics of u/ppp(o) on its matching recep- 
tor, a — 3, expressed in units I3~^ — kT 
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j=exp(|3J) 



Fig. 6. Selectivity §r between ligand C =u/ppp(o), 
=u/pppp(c), their matching rigid receptor M =u/ppp and 
a mismatching receptor W =u/pmp, as a function of the ligand 
stiffness j (see eq. I10|l . §r is directly related to the probability 
of the ligand to bind the decoy instead of the matching pat- 
tern. As expected Sr = 1 when j = 1 because a completely soft 
ligand binds equally well any receptor (non specific binding). 
Here a = 3. 



■(J 2000 - 




j=exp(pj) 



Fig. 9. Nonmonotic behavior of the affinity C vs j for 
£i =u/ppp.ppp(o), M =u/ppimnppm, k — 10. The effect is more 
pronounced for larger values of a. 





Fig. 7. Ground state shape of flexible cyclic ligand 
£ =u/pppp(c) and flexible receptor M =u/pppp. The 
4 monomers ligand and receptor are treated on the same foot- 
ing, and the partition sum encompasses 4* = 256 configuration 
states, which is done by tracing the fourth power of a transfer 
matrix. 





Fig. 8. Ground state shape of ligand L =u/ppp.ppp(o) and 
motif M =u/ppimnppm, involved in Figure 0. The dot stands 
for a very weak coupling constant between monomers 4 and 5. 




Fig. 10. Ground state shape of fiexible ligand 
L =u/ppppppp(o), and rigid receptor W ^u/pimnpmmp 
involved in Figure (|11[) . 



(_) 2000 - 




j=exp(|3J) 



Fig. 11. Nonmonotic downhill behavior of the affinity C" vs 
j for L =u/ppppppp(o) , M =pmiiipminp, k = 10. The effect is 
more pronounced when a increases. 
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j=exp(|3J) 



Fig. 12. Nonmonotic behavior of the affinity 6" vs j for 
£ =u/ppppppp(o) , M =u/piimipmmp, a = 2. The effect disap- 
pears when the flexibility k of the receptor decreases and the 
limit case k — 1 corresponds to a very soft, patternless recep- 
tor. 





Fig. 13. Ground state shape of cyclic ligand L =u/ppPppp(o) 
and motif M =u/ppmmp involved in Figure 1141 The ligand pos- 
sesses a hard bond (thick line) between monomers 3 and 4. This 
hard bond can be understood as being caused by the presence 
of an effector E that bridges the two neighboring beads 3 and 4. 



j=2 (open) 
j=2 (cyclic) 
j=1.5 (open) 
j=3 (open) 



U2 




j=exp(pj) 



Fig. 14. Affinity C" as a function of the stiffness exp(/3J) of 
the hard link, with j = exp(/3J) = 2 and a = exp(/3^/2) — 3. 
Because the stiff bond of the ligand lies on a mismatching re- 
gion, its hardening leads to a decrease in the affinity. Assuming 
that an effector molecule is the cause of this bond stiffening, 
as sketched in Figure 1131 one is left with a situation that re- 
minds the proposed mechanism of Hawkins and McLeish. At 
the opposite, when the stiff bond lies on a matching region, its 
hardening leads to an increase of the affinity (not shown). 




j=exp(PJ) 



Fig. 15. Configurational entropy change —TASc upon bind- 
ing, when the stiffness of the bond linking bead 3 and 4 is 
increased (Figures [TJ] and I14|l . The vertical dashed lines indi- 
cate the points where all bond stiffnesses are identical {j — j). 
One can infer from these curves the relative variation of config- 
urational entropies A AS caused by a change in j. For instance 
AAS is positive (binding is favored) for j > j = 1.5. The trend 
is reversed for j > j = S., and unsettled for j > j ~ 2. 
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1,1 



3 u/pppp(c) - u/pmpm 
a u/pppp(c) - u/pmmp 



j=exp(|5J) 

Fig. 16. Energy (enthalpy) change AU upon binding, when 
the stiffness of the bond linking bead 3 and 4 is increased 
f Figs [THl and 1 14|l . The vertical dashed lines indicate the points 
where all bond stiffnesses are identical {j — j). The relative 
energy change AAU is positive (binding is unfavored) for j = 
1.5 and j — 2, but slightly negative for j — 3. 




j=exp(|3J) 



Fig. 18. Selectivity §r for Hamming distance dn ~ 1 (straight 
lines) and dn = 2 (circles). The selectivity of the pair 
u/pppp(c) -u/pmpm is higher than the one of u/pppp(c) -u/pmmp, 
which means that u/pppp(c) binds u/pmmp (dn ~ 1) better 
than u/pmpm (d_r/ = 2). Other parameters are a = exp(/3yl/2) — 
3 and fc = exp{/3K) = 10. 




Fig. 19. Three dimensional graph of the relative selectivity 
ln[Gap(a, j, 8)] as a function of a and j, for patterns with 
length 8. Selectivity increases slowly with a and j. Receptors 
are rigid, k = 10. 
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(a) 



(b) 






> u/PPppPP(c)-PPppPP 
= u/PPppPP(c)-PPPPpp 
. u/PPppPP(c)-PpPpPp 
, u/PPppPP(c)-ppPPpp 



(c) 



(d) 



Fig. 20. A ligand and four receptors with same shape 
but different rigidity patterns are represented. The ligand is 
L =u/PPppPP(c), made of two rigid bonds, two soft bonds 
and then two rigid bonds (top). The receptors are (a) mid- 
dle left u/PPppPP, (b) middle right u/PPPPpp, (c) bottom left 
u/PpPpPp, (d) bottom right u/ppPPpp. 



u 




u/PPppPP(c) - PPppPP 
u/PPppPP(c) - PPPPpp 
u/PPppPP(c) - PpPpPp 
u/PPppPP(c) - ppPPpp 



j=exp(|3J) 

Fig. 21. Affinities C° for the four pairs introduced in Figure [20l 
as a function of j (stiffness of the soft bonds of the ligand, 
symbol p,m), see text for details. Parameters are a = 2, k = 2, 
fc = 10 (receptor hard bonds with symbol P,M), j = 10 (ligand 
hard bonds with symbol P,M). 



j=exp((3J) 

Fig. 22. Free energy differences in the case of Figure EIJl and 
Figure [21] result from a competition between energy (enthalpy) 
and entropy, where the energy term is larger by a factor 2 
compared with the entropic one. 



u/PPppPP(c) - PPppPP 
u/PPppPP(c) - PPPPpp 
u/PPppPP(c) - PpPpPp 
u/PPppPP(c) - ppPPpp 




exp(PJ) 



Fig. 23. Energy (enthalpy) differences in the case of Figure 1201 
and Figure [21] dominates the thermodynamics of binding. 



u/PPppPP(c) - PPppPP 
u/PPppPP(c) - PPPPpp 
u/PPppPP(c) - PpPpPp 
u/PPppPP(c) - ppPPpp 




j=exp(j3J) 

Fig. 24. Differences in entropy —TASc- The entropy loss is 
minimal for identical stiffness profile and maximal for opposite 
stiffness profile. 
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Fig. 25. A coarse molecule and a fine receptor (top), a fine 
molecule and a coarse receptor (bottom). 



Length n 


In(GAP) 


2 


3.65 


3 


3.22 


4 


2.91 


5 


2.67 


6 


2.47 


7 


2.30 


8 


2.15 



Table 1. Table of selectivities for patterns with increasing 
sizes. See text for details. 



